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Abstract The snakes comprising the monophyletic group referred to as ratsnakes are found throughout Asia, Europe 
and the New World. Recently, three snake samples likely belonging to the ratsnakes were collected in Zoige County, 
Sichuan Province, China. Species identity was difficult to delimit morphologically because the specimens were juve- 
niles and partially damaged. Subsequently, a molecular phylogenetic approach was used. Portions of three mitochon- 
drial genes (cyt b, ND4 and 12S rRNA) were sequenced and analyzed. The results showed that they were sister to the 
genus Elaphe. Very little genetic variation was found among the three samples. The minimum genetic distances be- 
tween these samples and those within Elaphe were greater than any currently recognized species within the genus. We 
conclude that this likely represents a new species within the genus Elaphe. Adult specimens and a morphologic descrip- 


tion are needed for further study. 
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1. Introduction 


The systematics of the group referred to as ratsnakes have 
been studied for more than a century but recently this 
group has been restricted to include the Old World genera 
Coelognathus, Gonyosoma, Rhadinophis, Maculophis, 
Euprepiophis, Oreophis, Orthriophis, Elaphe, Zamenis, 
Rhinechis, Oocatochus and Coronella as well as 9 genera 
in the New World tribe Lampropeltini (Utiger et al., 2002; 
Burbrink and Lawson, 2007). The ratsnakes likely origi- 
nated in the mid-Eocene in southeastern Asia, currently 
occurs throughout Asia, Europe and the New World, and 
is composed of 72 species (Schulz, 1996; Burbrink and 
2007;  Burbrink and 2010; 


Lawson, Pyron, 
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www.reptile-database.org). Within China, there are 15 
known species belonging to six genera (species numbers 
in parenthesis): Elaphe (6), Rhadinophis (2), Orthriophis 
(3), Euprepiophis (2), Oreophis (1), and Coelognathus (1) 
(Zhao, 2006). However, it is likely that given the large 
area of this country, with differed landforms and diverse 
habitats, that more taxa have evaded discovery. For in- 
stance, the Zoige Grassland in northwestern Sichuan is 
located within the northeastern part of the Tibetan Plateau. 
In this region, only two snake species have been docu- 
mented: Gloydius strauchi and Elaphe dione (Zhao, 
2003). Furthermore, it is likely that given the general 
cryptic nature of snakes that more species occur in this 
area (Ursenbacher et al., 2006; Huang et al., 2009). 

In August 2008, two neonate snakes (NS1, NS2) and 
one juvenile snake (JS) were collected in Zoige County 
(specific sites unknown) by local farmers. The juvenile 
was found dead on road (DOR) and the forepart of its 
body was crushed, making its positive identification 1m- 
possible. Furthermore, identifying the exact species and 
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genus of these three specimens was difficult based on the 
available material. Fortunately, muscle tissue was pre- 
served in 95% ethanol, which permitted molecular phy- 
logenetic analyses to be conducted in order to identify the 
genus of the specimens. 

Over the last two decades, the analyses of mitochon- 
drial DNA genes have successfully been used as a versa- 
tile and effective tool to investigate the phylogenetic rela- 
tionships of snakes (Huang et al. 2009). The most fre- 
quently used fragments are mitochondrial cytochrome b 
(cyt b) and/or NADH dehydrogenase subunit 4 (ND4) 
genes (e.g. Kraus and Brown, 1998; Kelly et al., 2003; 
Malhotra and Thorpe, 2004; Lawson et al., 2005; Burbrink 
and Lawson, 2007). Using these gene fragments, Huang et 
al. (2009) successfully found the closest related living rela- 
tives of the hot-spring snakes (Colubridae, Thermophis), 
the relict species endemic to the Tibetan Plateau. 

This study aims to first determine which subfamily 
these three samples would belong to within Caenophidia 
by using the datasets of Huang et al. (2009); and then 
with the result, to further examine the relationships 
within the subfamily by using more samples from that 
group. The results from this study suggest that the sam- 
ples are most likely allied to the ratsnakes, particularly 
the genus Elaphe. 


2. Material and methods 


2.1 Dataset construction The specimens (NS1, NS2, 
and JS) are deposited in the Museum of the Sichuan 
Academy of Forestry (field numbers ZJ200801-03). Total 
genomic DNA was extracted with phenol/chloroform as 
described in Sambrook et al. (1989). Complete cytochrome 
b (cyt b), partial ND4, and partial 12S rRNA genes were 
amplified from the total DNA extracts using the poly- 
merase chain reaction (PCR) with primer pairs 
L14910/H16064 (Burbrink et al., 2000) ND4/Leu 
(Arévalo et al., 1994), and L1091mod/H1557mod (Zaher 
et al., 2009), respectively. The PCR reaction contained 
approximately 100 ng of template DNA, 1 uL of each 
primer, 5 uL of x10 reaction buffer, 2 uL dNTPs (each 2.5 
mM), and 2.0 units of Taq DNA in a 50 uL reaction. Reac- 
tions were cycled at the following temperatures: an initial 
94°C denaturation for 3 min, 30 cycles of 94°C denatura- 
tion for 1 min, 46—55'C annealing for 1 min, 72°C exten- 
sion for 1 min, and a final 72°C extension for 5 min. 
Genes were purified from 0.8% low-melting agarose gels 
using the BioStar Glassmilk DNA purification Kit ac- 


cording to manufacturer's instructions. The purified DNA 
was sequenced with the Perkin-Elmer BigDye DNA Se- 
quencing Kit according to the manufacturer's protocol 
with the primers used in PCR. 

To determine the phylogenetic position of the three 
samples, we used the three datasets from Huang et al. 
(2009) which contained cyt b sequences for 130 species, 
ND4 sequences for 112 species, and a combined cyt b 
and ND4 dataset for 85 species. From these datasets, we 
determined that the snakes were likely related to the rats- 
nakes. Therefore, we downloaded these genes for 34 spe- 
cies of ratsnakes from GenBank which were used to fur- 
ther determine the specific phylogenetic relationships of 
these unidentified samples (Table 1). Of the 10 species of 
Elaphe (The closest relatives of the three samples, see 
Results) available from GenBank, E. davidi, E. sauro- 
mates and E. anomala were not available for cyt b and 
ND4. However, we acquired 17 sequences of partial 12s 
rRNA , including all the 10 species of Elaphe, that is, 3 
from E. bimaculata, 6 from E. dione and 1 from each of 
the rest 8 species. Additionally, 14 other genera of rats- 
nakes were downloaded from GenBank (Table 1). 


2.2 Phylogenetic inference Sequences for cyt b, ND4 
and 12S rRNA were aligned with other retrieved se- 
quences for the same gene regions respectively, and to the 
combined gene sequences using Clustal X (Thompson et 
al., 1997), and further proofread by eye. Bayesian infer- 
ence (BI) and maximum likelihood (ML) approaches 
were used to infer phylogenies using MrBayes v3.0 
(Huelsenbeck and Ronquist, 2001) and RAxML Black- 
Box  (http:;//phylobench.vital-it.ch/raxml-bb/index.php; 
Stamatakis et al., 2008). Modeltest 3.7 (Posada and 
Crandall, 1998) was used to determine the model of nu- 
cleotide substitution under the Akaike Information Crite- 
rion (AIC) for ML and Bayesian Information Criterion 
(BIC) for BI. In the Bayesian analysis, a starting tree was 
obtained using the neighbor-joining algorithm. Posterior 
probabilities (PP) were obtained by Markov chain Monte 
Carlo (MCMC) analysis with one cold chain and three 
heated chains. Samples of trees and parameters were 
drawn every 100 steps from a total of 1,000,000 MCMC 
steps. The first 100,000 generations (1,000 trees) were 
discarded as burn in (representing the generations in which 
the chain had not yet reached stationarity) and only the 
results from the last 900,000 generations (9,000 trees) were 
used to compute a consensus tree. We ran three additional 
analyses starting with random trees. The consensus of all 
the post-burn generations (3,600,000 generations result- 
ing in 36,000 trees) was estimated from all four runs. 
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Table1 Sequences of cyt b, ND4 and 12S rRNA genes from the ratsnakes and Ptyas (outgroup) retrieved from GenBank 


Accession No. 


Genus and species Sources 
cyt b ND4 12S rRNA 
Old world 
Coronella austriaca AY486930 AY487065 Nagy et al. (2004) 
Coronella girondica AF471088 AY487066 AY 122835 Lawson et al. (2005); Utiger et al. (2002) 
Elaphe anomala AY 122803 Utiger et al. (2002) 
Elaphe bimaculata DQ902104 DQ902283 Burbrink and Lawson (2007) 
Elaphe bimaculata AY 122837 Utiger et al. (2002) 
Elaphe bimaculata AY 122767 Utiger et al. (2002) 
Elaphe bimaculata AY 122768 Utiger et al. (2002) 
Elaphe carinata DQ902133 DQ902284 AY 122838 Burbrink and Lawson (2007); Utiger et al. (2002) 
Elaphe climacophora DQ902105 DQ902285 AY122772 Burbrink and Lawson (2007); Utiger et al. (2002) 
Elaphe davidi AY122775 Utiger et al. (2002) 
Elaphe dione DQ902107 DQ902287 Burbrink and Lawson (2007) 
Elaphe dione AY122834 Utiger et al. (2002) 
Elaphe dione AY122832 Utiger et al. (2002) 
Elaphe dione AY122833 Utiger et al. (2002) 
Elaphe dione AY122830 Utiger et al. (2002) 
Elaphe dione AY122831 Utiger et al. (2002) 
Elaphe dione AY122829 Utiger et al. (2002) 
Elapheb quadrivirgata DQ902120 DQ902300 AY122794 Burbrink and Lawson (2007) ; Utiger et al. (2002) 
Elaphe quatuorlineata AY486931 AY487067 AY122796 Nagy et al. (2004) ; Utiger et al. (2002) 
Elaphe sauromates AY122797 Utiger et al. (2002) 
Elaphe schrenckii DQ902124 DQ902302 AY122804 Burbrink and Lawson (2007); Utiger et al. (2002) 
Euprepiophis conspicillatus DQ902106 DQ902286 Burbrink and Lawson (2007) 
Euprepiophis mandarinus DQ902115 DQ902294 AY122784 Burbrink and Lawson (2007); Utiger et al. (2002) 
Oocatochus rufodorsatus DQ902123 DQ902301 AY122800 Burbrink and Lawson (2007); Utiger et al. (2002) 
Oreophis porphyraceus DQ902118 DQ902298 AY122790 Burbrink and Lawson (2007); Utiger et al. (2002) 
Orthriophis cantoris DQ902135 DQ902315 Burbrink and Lawson (2007) 
Orthriophis hodgsonii DQ902136 DQ902318 Burbrink and Lawson (2007) 
Orthriophis moellendorffi DQ902116 DQ902295 Burbrink and Lawson (2007) 
Orthriophis taeniurus EF076709 DQ902305 AY122809 Burbrink and Lawson (2007); Utiger et al. (2002) 
Rhinechis scalaris AY486932 AY487068 AY122801 Nagy et al. (2004); Utiger et al. (2002) 
Zamenis hohenackeri DQ902137 DQ902320 AY122779 Burbrink and Lawson (2007); Utiger et al. (2002) 
Zamenis situla DQ902125 DQ902303 Burbrink and Lawson (2007) 
NS1 and NS2 (NSs) HQ330524 HQ330526 HQ330522 This study 
JS HQ330525 HQ330527 HQ330523 This study 
Ptyas korros AY486929 AY487062 Nagy et al. (2004) 
Ptyas mucosus AF471054 AY487063 Lawson et al. (2005) 
New world 
Arizona elegans DQ902101 DQ902279 AY 122810 Burbrink and Lawson (2007); Utiger et al. (2002) 
Bogertophis subocularis DQ902103 DQ902281 Burbrink and Lawson (2007) 
Bogertophis rosaliae DQ902102 AF 138751 AY 122815 Burbrink and Lawson (2007); Utiger et al. (2002) 
Lampropeltis getula FJ997848 AF 138759 Pyron et al. (2009) 
Lampropeltis alterna AF337130 AY497307 Fetzner et al. (Unpubl.) 
Lampropeltis calligaster DQ902129 DQ902311 Burbrink and Lawson (2007) 
Lampropeltis ruthveni AY 122820 Utiger et al. (2002) 
Pseudelaphe flavirufa DQ902109 DQ902289 AY 122841 Burbrink and Lawson (2007); Utiger et al. (2002) 
Pantherophis obsoletus AF283644 DQ902296 AY 122843 Burbrink et al. (2000) ; Utiger et al. (2002) 
Pantherophis vulpinus AF283638 AF 138758 Burbrink et al. (2000) 
Pantherophis bairdi AF283599 AF 138755 Burbrink et al. (2000) 
Pantherophis guttatus DQ902111 DQ902291 Burbrink and Lawson (2007) 
Pituophi deppei AY 122827 Utiger et al. (2002) 


Senticolis triaspis DQ902127 AF 138775 AY 122848 Burbrink and Lawson (2007); Utiger et al. (2002) 
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In the phylogenetic analyses, a low character to termi- 
nal taxa ratio (i.e., short sequences and many taxa) tended 
to result in low node supports, especially for the internal 
nodes (Kelly et al., 2003). When the relatives of the three 
samples were found, their sequences should be grouped 
as “small datasets" to increase the character to taxa ratio, 
in order to further understand the relationship, particu- 
larly which species the three samples would belong to. 
Using RaxML v7.2.6, we estimated the ML tree and 
1,000 nonparametric bootstraps (BS) using the GTR+I+G 
model for cyt b and ND4 combined sequences. The un- 
corrected p-distances within the “small datasets" were 
calculated using MEGA 4 (Tamura et al., 2007). 


3. Results 


3.1 Sequence characteristics Alignment by eye of the 


cyt b, ND4 and 12s rRNA sequences of the three samples 
generated by this study resulted in 1015, 696, and 428 
sites, respectively. The two neonate snakes (NS1 and NS2) 
shared identical haplotypes for cyt b, ND4 and 12s rRNA, 
respectively. 

The percentages of the four nucleotides were relatively 
similar across Caenophidiea, ratsnakes, Elaphe, and the 
new snakes, and showed strong bias against guanidine, 
typical for mitochondrial DNA (de Queiroz et al. 2002; 
Malhotra and Thorpe, 2004; Huang et al. 2009; Table 2). 
While phylogenetic results (See below) suggest that the 
unidentified snakes are closely related to Elaphe, uncor- 
rected p-distances among the three genes reveal that the 
new taxa attain the maximum difference within the genus 
(Table 3). 


3.2 Phylogenetic analyses To find the systematic posi- 
tion of NSs and JS in Caenophidia, three datasets (cyt b, 


Table2 Average nucleotide composition (%), numbers of variable (V) and parsimony informative (PI) sites for cyt 5, ND4 and 12S rRNA 


gene fragments, based on the data used in the present study. 


Cytb (1 015 aligned sites) ND4 (696 aligned sites) 12S rRNA(428 aligned sites) 
A T C A GVA Peu A G y BI T C o xd. oC BI 
Caenophidia n-132 27.6 29.8 31.6 11.0 718 645 n=114 25.5 29.6 334 11.5 471 414 
Ratsnakes n=36 28.0 28.1 32.9 11.0 491 E n-36 26.7 27.6 344 11.3 355 any n=33 20.4 24.5 37.8 17.3 140 d 
Elaphe n-9 27.0 29.1 32.9 11.0 302 184 n=9 25.9 29.1 33.6 11.4 221 140 n=19 20.2 24.6 37.8 17.4 94 67 
NSsandJS n-2 26.0 30.8 32.0 112 1 n-2 24.7 29.7 340 11.5 1 n=2 19.8 253 37.6 173 2 
n: Numbers of sequences of each group. 
%: Pl/aligned sites 
Table3 Sequence divergences between species of Elaphe for cyt b, ND4 and 12S rRNA sequences 
Uncorrected p-distance 
cyt b ND4 12S rRNA 
Taxa Max Min Mean Max Min Mean Max Min Mean 
Ratsnakes 0.181 0.067 0.138 (n=33) 0.187 0.067 0.136 (n-33) 0.139 0.039 0.086 (n=25) 
Elaphe 0.132 0.096 — 0.117(n-7) — 0.143 0.108 — 0.126(n-7) 0.104 0.042 0.068 (n-11) 
NSs vs. JS 0.001 (n2) 0.002 (n=2) 0.005 (n=2) 
NS/JS. vs. Ratsnakes 0.175 0.107 0.142 (n=33) 0.172 0.111 0.130 (n=33) 0.111 0.067 0.085 (n=25) 
NS/IS. vs. Elaphe 0.142 0.117 0.129(n=8) — 0.140 0.127 0.132(n-8) 0.104 — 0.069 0.088 (n= 11) 


*: Despite the distance of 0.005 between E. anomala and E. schrenckii, they may be conspecies, or the former is a subspecies of the latter (Pope, 1935; 


Utiger et al., 2002). 
n: Numbers of sequences of each group. 
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NDA, and cyt b +ND4) from Huang et al. (2009) were 
used with BI and ML approaches. These topologies 
were similar to those presented in Huang et al. (2009). 
The basic relationships of the commonly recognized 
families and/or subfamilies are similar: Acrochordus 
(Viperidae (Atractaspidinae, Boodontinae, Elapinae, 
Hydrophiinae, Psammophiinae, Pseudoxyrhophiinae), 
(Colubrinae, Natricinae, Xenodontinae)) This basic 
relationship is the same as that of Kelly et al. (2003) and 
Lawson et al. (2005). In our trees, NS1, NS2 and JS fell 
within the ratsnakes (including the Old World genera 
Coronella, Elaphe, Euprepiophis, Oocatochus, Oreophis, 
Orthriophis, Rhinechis, Zamenis, and New World gen- 
era Bogertophis, Lampropeltis, Arizona, Pseudelaphe, 
Pantherophis, Senticolis; Utiger et al., 2002; Burbrink 
and Lawson, 2007) with high (BI trees) and moderate 
(RAxML trees) supports (Table 4). 

The topologies of the ratsnakes were similar among 
the individual mtDNA genes and their combination, ex- 
cept with respect to the uncertainty of the monophyly of 
the genus Orthriophis. The three samples (NS1, NS2 and 
JS) are placed, with high (PP) or moderate (BS) support, 
as sister to Elaphe (Table 4). Due to the difficulty in 
aligning 12S rRNA, the sequences were found to yield 
trees with poor resolution. 


4. Discussion 


Comprehending the diversity of snakes within any region 
has traditionally relied on the analyses of morphological 
data. However, for many species of snakes, morphologi- 
cal data have proven difficult to uncover cryptic diversity 
and to understand the relationships among taxa, due in 
part to their specialized and simple body structures 
(Huang et al. 2009; Burbrink and Crother, 2010). There- 
fore, understanding the relationships among snakes glob- 


ally and regionally has excelled in the last decade due to 
molecular phylogenetic methods (Lawson et al., 2005; 
Huang et al., 2009; Burbrink and Crother, 2010). 

Utiger et al. (2002) examined the phylogenetic rela- 
tionships of the Holarctic ratsnakes (Zlaphe auct.) using 
portions of sequences from two mitochondrial genes, 12s 
rRNA and CO1. They proposed some taxonomic changes, 
established two new genera (Orthriophis and Oreophis, 
gen. nov.), and concluded that Elaphe (sensu stricto) 
Fitzinger, 1833 is made up of ten Palaearctic species: E. 
sauromates (type species), E. anomala, E. bimaculata, E. 
carinata, E. climacophora, E. davidi, E. dione, E. 
quadrivirgata, E. quatuorlineata, and E. schrenckii. Their 
conclusions are generally verified in another study using 
more data and taxa (Burbrink and Lawson, 2007). 

The three enigmatic samples in the present study al- 
ways form a monophyletic clade and are closely related 
to Elaphe, and possibly are the sister species to the re- 
maining members of the genus (Figure 1, Table 4). The 
very small genetic distance among the three specimens 
(0.001 for cyt b, 0.002 for ND4, 0.005 for 12s rRNA, see 
Table 3) suggest they are part of the same population. 
Additionally, the minimum genetic distance between 
these samples and Elaphe is always greater than that 
among the species within this genus: 0.117>0.096 for cyt 
b, 0.127>0.108 for ND4, and 0.069>0.042 for 12s rRNA 
(Table 3). 

We conclude that these specimens likely represent a 
yet undescribed species of Elaphe, elevating the number 
of species in this genus to 11 and the number of total 
ratsnakes in China to 16. This would also elevate the 
number of snakes in the Zoige to 3. We anticipate ac- 
quiring additional material to formally describe this new 
taxon, characterize its mensural and meristic, and mor- 
phological variation, estimate its distribution range in 
western China, and describe its general ecology. 


Table4 Support of the monophyly of Elaphe (including NSs and JS) and the ratsnakes in phylogenetic trees based on various datasets 


Ratsnakes in Caenophidia 


Elaphe within the ratnsakes 


Cyt b (n=132)  ND4(n-114) Cytb + ND4 (n=87) Cytb (n=36) ND4 (n=36) Cytb + ND4 (n=36) 
Posterior Prob. 98 100 98 100 74 100 
Bootstraps 68 - 75 76 - 85 


n: Numbers of sequences of each dataset. 
—: < 50. 
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100 
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100 
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100 
100 


Pantherophi obsoletus 
Pantherophi bairdi 
Pantherophi vulpinus 
Pantherophi quttatus 
Bogertophi rosaliae 
Pseudelaph flavirufa 

100 Lampropelti qetula 
100 Lampropelti alterna 

Lampropelti calliqaster 


Bogertophi subocularis 


Arizonaeleqans 


Rhinechi scalaris 
Zameni situla 
Zameni hohenackeri 
Coronell austriaca 
Oocatochu rufodorsatus 
Coronell qirondica 
Elaph quadrivirqata 
Elaph schrenckii 
Elaph carinata 
Elaph climacophora 
Elaph bimaculata 
Elaph dione 
Elaph quatuorlineata 
TA = Elaphe sp. ? 
Euprepiophi conspicillatus 
Euprepiophi mandarinus 
Oreophi porphvraceus 
Orthriophi cantoris 
Orthriophi hodqsonii 
Orthriophi moellendorffi 


Orthriophi taeniurus 


Ptva mucosus 


Ptva korros 


Senticoli triaspis 


Figure 1 The 50% majority rule consensus tree from the Bayesian analysis based on cyt b and ND4 combined sequences. Posterior 
probability values and RAxML supports are shown above and below the corresponding nodes (under 5096 from both analyses omitted). 
Likelihood settings from best-fit model (GTR+I+G) selected by BIC in Modeltest 3.7 (-In L=19544.7363). Base- (0.3799 0.3230 
0.0781), Nst=6, Rmat=(0.7682 9.9929 1.4101 0.3322 20.6632), Rates-gamma, Shape-0.6621, Pinvar-0.4522. Red: Mainly eastern 
Palaearctic or Oriental distribution; Yellow: Mainly western Palaearctic distribution; Green: Mainly New World distribution (according 


to Utiger et al., 2002). 
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